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ABSTRACT 



PROBLEM TO BE SOLVED: To perform static single substitution when a code 
with predicate exists. 

SOLUTION: Guards placed in the individual source operands of a &phiv; 
function show conditions where the corresponding source operands are 
effective and can be precisely embodied after code reordering. The guards 
show a basic bloclc being the source of a side which is correlated to the 
source operand with respect to a control function &phiv;c showing the 
confluence of reached effective definitions in the junction of control flow 
graphs. A &phiv;c operand is made into a group with the source basic block 
of the inputted effective side along the operand. The operands are ordered 
in the phase order of blocJcs on-'^the ' operands i When a predicate code exists, 
a predicate &phiv; function &phiv;p supplies a mechanism for performing 
SSA. The &phiv; function can precisely be embodied after code reordering. 
The guard of the source operand shows the predicate where the corresponding 
operand is defined with respect to &phiv;p. 
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Method and system for correlating profile data dynamically generated from 
an optimized executable program with source code statements 

Verfahren und Anordnung zur Korrelation von Proflldaten dynazcdsch erzeugt 
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A method and system for relating profile data (700) generated by 
monitoring the execution of an optimized machine-code computer program 
(600) back to the source-l^K^uage description, (400) of the computer 
program. Logical line numbers (604) are associated with the basic blocks 
(606, 607, 608) of the intermediate-code representation (600) of the 
computer program and actual line numbers (602) are associated with each 
instruction (610, 612, 614, 616) of the intermediate-code representation 
of the computer program. During optimization of the intermediate code, 
the logical line numbers remain fixed to basic blocks, while actual line 
numbers remain fixed to intermediate-code instructions. A branch 
instruction (610) and the target of the branch instruction (612) in the 
optimized machine-code computer program or in an optimized 
assembly-language computer program (600) can be related back to 
sourcerlanguage, statements (4flO) by using the actual line number. (602) 
associated with the branch instruction (610) and the logical line number 
(604) associated with the basic block (618) that contains the target of 
the branch instruction. 
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COMPUTER ARCHITECTURE FOR THE DEFERRAL OF EXCEPTIONS ON SPECULATIVE 
INSTRUCTIONS 

ARCHITECTURE DE MACHINE POUVANT REALISER UN REPORT D* EXCEPTIONS DANS DES 
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Publication Language: English 
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Detailed Description 
Claims 

Fulltext Word Count: 8913 
English Abstract 

The inventive system (300) and method allows for software control (320) 
of hardware (330) deferral of exceptions in speculative operations (104), 
and comprises three components. The first component is processor stored 
• information (-10.7) which reflects . the code generation strategy of. 

applications and is used by hardware and the operating system to control 
exception deferral. The second component is processor stored information 



(105) set by the operating system to specify to hardware which type of 
faults should be automatically deferred. The third component is further 
processor stored information (102) which indicates to the hardware to 
defer . certain, exception caus4.ng aspects „of ^ the speculative operation^ 
while performing other non excepting aspects of the speculative 
operation. The stored information is set after the operating system 
exception handler (200) has unsuccessfully attempted fault resolution 
(213) . 

French Abstract 

Le systeme (300) de 1* invention et son precede permettent de controler 
par logiciel (320) des reports d*exceptions machine (330) dans des 
operations speculatives (1^0)^)'' et comprennent trois elements. Le premier 
element (107) est une information stockee en memoire centrale refletant 
la strategic de generation de code d * applications, et mise en oeuvre par 
la machine et le systeme d * exploitation pour controler le report 
d * exceptions . Le deuxieme element (105) est une information stockee en 
memoire centrale instauree par le systeme d * exploitation pour indiquer a 
la machine quel type d'erreur doit etre automatiquement reporte. Le 
troisieme element (102) est egalement une information stockee en memoire 
centrale qui indique a la machine de differer certains aspects de 
1* operation speculative qui sont causes d* exceptions mais de realiser en 
meme temps d'autres aspects de 1* operation speculative qui ne sont pas 
causes,. d' exceptions.. L * information stockee. est instauree apres.que l,e 
gestionnaire d* exception (200) du systeme d * exploitation ait effectue une 
tentative reussie de resolution d*une erreur (213). 
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METHOD AND APPARATUS FOR 'TRANSFERRING DATA BETWEEN A REGISTER STACK AND A 
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PROCEDE ET APPAREIL SERVANT A TRANSFERER LES DONNEES ENTRE UNE PILE DE 
REGISTRES ET UNE RESSOURCE MEMOIRE 
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English Abstract 

A computer-implemented method and apparatus for transferring the 



contents of a general register (80), in a register stack (86), to a 
location in a backing store (94) in a main memory (82) . When transferring 
the contents of a general register to a location in the backing store, 
the invention p-roposes -callect^ing - attribute bits included in each general 
register of a predetermined group of registers in a temporary collection 
register (108 or 110) . Once the temporary collection register has been 
filled, the contents of this register are written to the next available 
location in the backing store. Similarly, on the restoration of registers 
from the backing store, a collection of attribute bits saved in the 
backing register is transferred to a temporary collection register. 
Thereafter, each attribute bit is saved together with associated data 
into a general register, ther^eby to restore the former contents of each 
general register. "^^^ 

French Abstract 

L' invention concerne un precede informatique et un appareil destine a 
transferer le contenu d*un registre general, dans une pile de registre, 
vers un emplacement dans une memoire auxiliaire faisant partie de la 
memoire principale. Selon 1* invention, lors du transfert du contenu d'un 
registre general vers un emplacement dans la memoire auxiliaire, on 
recueille les bits d'attribut, contenus dans chaque registre general 
faisant partie d'un groupe predefini de registres, dans un registre de 
collecte temporaire. Une fois le registre de collecte temporaire rempli, 
le contenu de ce registre est**enregi*stre a • 1 * emplacement suivant> 
disponible dans la memoire auxiliaire. D*une facon similaire, lors de la 
restauration des registres a partir de la memoire auxiliaire, on 
transfere une collection de bits d * attribution sauvegardee dans la 
memoire auxiliaire a destination d*un registre de collecte temporaire. 
Par la suite, chaque bit d'attribut est sauvegarde avec les donnees qui 
s*y rapportent dans un registre general, ce qui a pour effet de restaurer 
l*ancien contenu de chaque registre general. 
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Coirpiler based automated error recovery method for computer application, 
involves automatically generating executable check point object code for 
recovery, if execution of program is interrupted 
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NOVELTY - Program source code is compiled into intermediate code. 
Check points identified in the intermediate program code, are 
associated with data objects. Executable check point code for execution 
at the check points, is automatically generated in which the check 
point code is configured to store state information of the associated 
data objects for recovery, if execution of the program is interrupted. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: * - • ... 

(1) error recovery apparatus; and 

(2) error recovery program product. 

USE - For compilers in computer applications e.g. transaction 



processing application. 

ADVANTAGE - Enables automatically generating check point object 
code without having check points specified in the source code, thus 
reducing the developer's ^^k. 

DESCRIPTION OF DRAWtNG(S) - The figure shows a flowchart explaining 
automated error recovery method. 

pp; 10 DwgNo 1/4 

Title Terms: COMPILE; BASED; AUTOMATIC; ERROR; RECOVER; METHOD; COMPUTER; 

APPLY; AUTOMATIC; GENERATE; EXECUTE; CHECK; POINT; OBJECT; CODE; RECOVER; 

EXECUTE; PROGRAM; INTERRUPT 
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Program code coinpiling method, involves delineating object code segment 
sets by checkpoints in pajS^ram code and generating checkpoint code for 
execution at checkpoints to save state information of program 

Patent Assignee: THOMPSON C L (THOM-I) 

Inventor: THOMPSON C L 
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Abstract (Basic) : US 20040034814 Al 

NOVELTY - The method Involves generating a set of object code 
segments optimized at an igp,timization level. Another set of segments 
associated with the fo^ffter set is optimized at another level. 
Checkpoints in a program code delineate the object code segment sets. 
Checkpoint code is generated for execution at the checkpoints. The code 
saves state information of the program. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) an apparatus for compiling program code 

(b) an article of manufacture comprising a computer readable medium 
with instructions for causing a processor to perform program code 
compiling method. 

USE - Used for compiling program code for recovering from fatal 
errors encountered in e'xecUfing ' cotnpUter * program code. . . , 

ADVANTAGE - The method processes the intermediate code on a segment 
by segment basis, thereby eliminating the step of undoing the 
optimizations that introduces the errors. The method generates 
alternative code that supports recovery from a fatal program error. 

DESCRIPTION OF DRAWING (S) - The drawing shows a flowchart of an 
example process for error recovery in accordance with program 
compilation techniques . 
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Processing system for efficiently processing a program that includes 
assertion instructions, has compiler enabling selective execution of 
assertion handling code based on run time input in response to failed 
assertion test 

Patent Assignee: HEWLETT-PACKARD DEV CO LP (HEWP ) 

Inventor: DWYER L D K B; THOMPSON C L 

Number of Countries: 001 Number of Patents: 001 

Patent Family : 

Patent No Kind Date ^Kpplicat No Kind Date Week 

US 6701518 Bl 20040302 US 2000631552 A 20000803 200424 B 

Priority Applications (No Type Date) : US 2000631552 A 20000803 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6701518 Bl 14 G06F-009/44 

Abstract (Basic) : US 6701518 Bl 

NOVELTY - The system has a compiler for translating the first 
function of a first program stored in a memory into the second function 
of a. second pxogram. .The compiler .enables selective execution of , 
assertion handling code based on the run time input in response to a 
failed assertion test defined by the translated assertion instructions. 

DETAILED DESCRIPTION - The first function includes assertion 
instructions, while the second function includes the translated 
assertion instructions. The compiler enables selective execution of a 
portion of the translated assertion executions based on the run time 
input. An INDEPENDENT CLAIM is included for efficiently processing a 
program that includes assertion instructions. 

USE - For enabling si^Wctive assertion of testing computer programs 
based on run time inputs. 

ADVANTAGE - Reduces the adverse impact of assertion instructions to 
processor performance to encourage programmers to include assertion 
instructions in computer programs. 

DESCRIPTION OF DRAWING (S) - The figure is a block diagram showing a 
function compiled by a compiler in efficiently processing a program. 
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Computer program data structure invariant property verification method 
involves verifying whether runtime value of data structure is consistent 



I 



with invaricuit property during execution 

Patent Assignee: LITTFIN J (L^TT-I) ; THOMPSON C L (THOM-I) 

Inventor: LITTFIN J ; THOM>SON C L 

Number of Countries: 001 Number of Patents: 001 

Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20040015897 Al 20040122 US 2001858241 A 20010515 200415 B 

Priority Applications {No Type Date) : US 2001858241 A 20010515 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20040015897 Al 6 G06F-009/44 

Abstract" (Basic) :' US '200400158§T'Ai 

NOVELTY - A code verifying whether a runtime value of data 
structure is consistent with invariant property, is automatically 
generated in response to annotation of data structure in source code. 
The runtime value is compared to invariant property in the generated 
code, during execution of program. If the runtime property is 
inconsistent with the invariant property, the program branches to 
exception handler code. 

DETAILED DESCRIPTION )i -(An INDEPENDENT CLAIM is also included for 
apparatus for verifying invariant property of data structure of 
computer program at run time. 

USE - For verifying invariant property which is range of data 
values, of data structures of computer programs in variety of 
programming languages for e.g. C and C++, at runtime. 

ADVANTAGE - Avoids the need for a developer to specify the range of 
valid addresses for code or data, since the compiler determines the 
valid range of addresses of code or data. Realizes a method capable of 
being applicable to a variety of programming languages particularly 
beneficial in compiled languages, for e.g. C and C++. 

DESCRIPTION OF DRAWING (SJ -The figure shows the flowchart 
explaining the process of verifying invariant properties of data 
structures at runtime. 
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Register allocation method in optimizing compiler, involves assigning 
variables having lifetime greater than initiation interval of source code 
programming loop, to rotating register 

Patent Assignee: SRINIVASAN U (SRIN-I) ; THOMPSON C (THOM-I) 

Inventor: SRINIVASAN U; THOMPSON C 

Number of Countries: 001 Number of Patents: 001 

Patent Family : 

Patent No Kind Date Applicat No Kind Date Week 

US 20030237080 Al 20031225 US 2002177343 A 20020619 200411 B 

Priority Applications (No Type Date) : US 2002177343 A 20020619 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030237080 Al 19 G06F-009/45 



Abstract (Basic) : US 20030237£8^0 Al 

NOVELTY - A rotating*. register is allocated for each identified 
variables having lifetime greater than initiation interval of a present 
source code prograitiming loop, when the variables are initiated within 
the prograitiming loop. A scalar register allocator connected to the 
rotating register, assigns variables outside of the programming loop, 
to an allocated and unassigned rotating register. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) computer readable medium storing register allocation program; 

(2 ) compiler; and 

(3) optimizing compiler. 

USE - For- allocating register in -optimizing compiler (claimed)- for 
microprocessor of computer. 

ADVANTAGE - Efficiently improves register allocation in optimizing 
compiler. 

DESCRIPTION OF DRAWING (S) - The figure shows the flowchart 
explaining register allocation method, 
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Code generation method for module scheduled uncounted loops in compilers, 
involves assigning stage predicate to each instruction in each 
speculative stage to conditionally enable or discJ^le instruction 
execution 

Patent Assignee: HANK R E (HANK-I); MORRIS D (MORR-I); SRINIVASAN U 

(SRIN-I); THOMPSON C L (THOM-^) 
Inventor: HANK R E; MORRIS ^^^^ ^SRINIVASAN U; THOMPSON C L 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030233643 Al 20031218 US 2002175375 A 20020618 200408 B 

Priority Applications (No Type Date) : US 2002175375 A 20020618 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030233643 Al 28 G06F-009/45 

Abstract • (Basic) :- US -200302'3364a Al ... 

NOVELTY - A given stage predicate is assigned to each instruction 
in each speculative stage, and the stage predicate is used to 
conditionally enable or disable the execution of an instruction during 
the prologue and epilogue execution. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) counted loops processing method; 

(2) fully if-converted ^oops processing method; 

(3) uncounted loops^rocessing method; 

(4) module scheduling apparatus; and 

(5) computer program product for generating codes for modules 
scheduled uncounted loops. 

USE - For generating codes for module scheduled uncounted loops in 
compilers in computer system. 



ADVANTAGE - By assigjjWg stage predicates to instructions in the 
speculative stages, unconditional and unnecessary execution of 
speculative loads and the cost of bringing in unnecessary data into the 
data cache are avoided. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of the 
scheduler and register allocator. 

global and loop optimizer (320) 

schedule and register allocator (330) 

global scheduler (410) 

module scheduler and register assigner (420) 
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Coit5>uter program modification method involves including prefetch 
instructions prior to instructions corresponding to cache -miss 
conditions, when frequency of recurring patterns is more than selected 
frequency 

Patent Assignee: HEWLETT-PACKARD CO (HEWP ) 
Inventor: THOMPSON C L 

Number of Countri,es : .001. Kumb^;;. of , Patents :, 001 ... 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 20030225996 Al 20031204 US 2002160476 A 20020530 200406 B 

Priority Applications (No Type Date) : US 2002160476 A 20020530 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 20030225996 Al 10 G06F-015/00 

Abstract (Basic) : US 20030225996 Al 

NOVELTY - Profile data including instruction and target addresses, 
loaded and stored data, and the execution counts, is generated for 
executed load and store instructions. Recurring patterns of the 
instructions corresponding to cache-miss conditions, are identified 
from profile data. Prefetch instructions are included prior to the 
instructions of the patterns, when recurring frequency is more than 
selected frequency . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

. (1) compu.ter .readable storage medium, storing computer program , 
modification program; and 

(2) apparatus for modifying computer program. 

USE - For modifying computer program by including prefetch 
instructions corresponding to cache-miss patterns. 

ADVANTAGE - Since prefetch instructions are included corresponding 
to the cache-miss condition, waiting time for acquiring stored data 
from the memory is eliminated. Hence program execution time of the 
processor, is shortened. 

DESCRIPTION OF DRAWiij)^ ('s ) - The figure shows the flowchart 
explaining the program 'modification process, 
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Program code compiling method, involves creating checkpoint code for 
execution at checkpoints to save program state information and selecting 
object code segment for execution in response to program execution error 

Patent Assignee: HEWLETT-PACKARD DEV CO LP (HEWP ) 

Inventor: THOMPSON C L 

Number of Countries: 001 Number of Patents: 001 
Patent Family: -/ 

Patent No Kind Date .^Applicat No Kind Date Week 

US 6658656 Bl 20031202 US 2000702592 A 20001031 200405 B 

Priority Applications (No Type Date) : US 2000702592 A 20001031 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6658656 Bl 9 G06F-009/45 

Abstract (Basic) : US 6658656 Bl 

NOVELTY - The method involves generating two sets of object code 
segments optimized at two optimization levels, and identifying 
checkpoints in a program cocle during * compilation . Checkpoint ' code xs 
generated for execution at the checkpoints to save state information of 
the program. One object code segment is selected from among the code 
segments for execution in response to a program execution error. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(1) an apparatus for compiling program code 

(2) a computer program product to cause a computer to perform 
program code compilation .y^ v 

USE - Used for compiling a program code. 

ADVANTAGE - The method creates alternative versions of code 
segments and dynamically substitutes execution of the alternative code 
versions . 

DESCRIPTION OF DRAWING (S) - The drawing shows a flowchart of a 
process for error recovery based on program compilation, 
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Register access management method for use in microprocessor, involves 
determining whether register referenced by instruction is within current 
register stack frame, based on which instruction is executed 

Patent Assignee: INST DEV EMERGING ARCHITECTURES LLC (EMER-N) 



Inventor: COUTANT C A; ROSS J K; THOMPSON C L ; ZAHIR A R 
Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6665793 Bl 20031216 US 99473820 A 19991228 200404 B 

Priority Applications (No Type Date) : US 99473820 A 19991228 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 

US 6665793 Bl - 11 GG6P-009/^312 • - ■ - . - , . . 

Abstract (Basic) : US 6665793 Bl 

NOVELTY - The method involves determining whether a register 
referenced by an instruction is in a current register stack frame. The 
instruction is executed to write data into the register or to read data 
from the register, when the register is determined to be within the 
current register stack frame. 

DETAILED DESCRIPTION r-^NDEPENDENT CLAIMS are also included for the 
following: 

(1) register access management apparatus; and 

(2) microprocessor. 

USE - For managing access to register used in microprocessor 
(claimed) . 

ADVANTAGE - Prevents a program executing on microprocessor from 
overwriting registers that are in stack frames. 

DESCRIPTION OF DRAWING (S) - The figures show the flowchart 
illustrating the register access management process, 
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Coznputer program code executing system, has processing circuitry to 
execute unconditional branch instruction based on mode indicator when 
code is disabled and refrains from instruction, and executing code when 
code is enabled 

Patent Assignee: HEWLETT-PACKARD DEV CO LP (HEWP ) 

Inventor: HUCK J; THOMPSON C L 

Number of Countries: 001- N-umbes?- of • Patents 001 ... 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6654877 Bl 20031125 US 2000644435 A 20000823 200382 B 

Priority Applications (No Type Date) : US 2000644435 A 20000823 
Patent Details; 

Patent No Kind Lan Pg Main IPC Filing Notes 
US 6654877 Bl 10 G06F-0^*9/44 

Abstract (Basic) : US 665487'7 Bl 

NOVELTY - The system has a processing circuitry (32) that receives 
run time data indicating whether a set of codes is enabled or disabled. 
A value of a mode indicator (71) is set based on the data. The 
circuitry executes an unconditional branch instruction based on the 
indicator when the code is disabled and refrains to execute the 
instruction based on the indicator. The circuitry executes the set of 
code on enabling. 



DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for a 
method of selectively executing sets of code in computer programs. 

USE - Used for selectively executing sets of code in a computer 
program. 

ADVANTAGE - The presence of the disabled code in the program does 
not affect the performance of the program since the execution of the 
code is based on the status of the mode indicator. The circuitry 
executes unconditional branch instruction based on the mode indicator, 
thereby preventing execution of the set of code when the code is 
disabled, 

DESCRIPTION OF DRAWING (S) - The drawing shows a block diagram of a 
processing circuitry in a computer system. 
Processing circuitry (32) 
Mode indicator (71) 
Instruction dispersal unit (75) 
Control circuitry (77) 
Pipelines (79) 
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Coiciput:er system e.g. portable coinputer system, has processor which 
selects specific branch instruction identifier based on value of mode 
indicator so as to branch processing 

Patent Assignee: HEWLETT- PACK|^RD DEV CO LP (HEWP ) 

Inventor: HUCK J; THOMPSON^*?: L 

Number of Countries: 001 Number of Patents: 001 
Patent Family: 

Patent No Kind Date Applicat No Kind Date Week 

US 6643769 Bl 20031104 US 2000644315 A 20000823 200377 B 

Priority Applications (No Type Date) : US 2000644315 A 20000823 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
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Abstract * (Basic) :' US '6643769 bI'^' 

NOVELTY - A processor (32) executes the instruction of memory based 
on the address identifiers included in the branch instructions. A 
specific identifier is selected based on the value of mode indicator 
(55) during execution of branch instructions. The processing is 
branched using the identifiers, in response to the enabling and 
disabling of codes. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for 
selective computer code e)ce<cution method. 

USE - Computer system e.g. portable computer system. 

ADVANTAGE - Execution of computer code is either enabled or 
disabled by controlling the value of mode indicator thereby flexible 
validation is achieved. Flipping operation is performed quickly without 
delaying the execution of branch instruction thereby quickens 
validation time. 

DESCRIPTION OF DRAWING(S) - The figure shows the block diagram of 
computer system, 
memory (29) 



processor (32) 
local interface (36) 
input unit (38) 
mode indicator (55) 
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Code characteristic graphic representation providing method involves 
displaying unique graphical indicator along with block of code in 
program, to indicate presence of instruction characteristic in block of 
code 

Patent Assignee: COUTANT C A (COUT-I); THOMPSON C L (THOM-I) 

Inventor: COUTANT C A; THOMPSON C L 

Number of Countries: 001 Number of Patents: 001 

Patent Family: * ' • . - . w. ... 

Patent No Kind Date Applicat No Kind Date Week 
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Abstract (Basic) : US 200301W?^57 Al 

NOVELTY - A block of code in a program, is acquired and analyzed 
for an instruction characteristic. An unique graphical indicator 
generated for the instruction characteristic, is displayed along with 
the block of code to indicate the presence of instruction 
characteristic in block of code. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(1) code characteristic graphic representation providing system; 

and 

(2) computer readable medium for storing code characteristic 
graphic rep'resentation .* * ' ' * " ' ... 

USE - For producing graphic representations of code characteristic. 

ADVANTAGE - Conveys information to a developer/programmer about the 
characteristics of the optimized code . 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart 
illustrating utilization of optimized code. 
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Information organizing method for data mining purposes in electronic 
information &ys.tem&, cozc$>rises. providing a. visualization of actor 
communications in the context of one or more discussions 

Patent Assignee: CATAPHORA INC (CATA-N) 

Inventor: CHARNOCK E B; ROBERTS S L; THOMPSON C 
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AU 2003207856 Al G06F-017/60 Based on patent WO 200367497 

Abstract (Basic) : WO 200367497 Al 

NOVELTY - A method of organizing information comprises providing a 
visualization of actor (310) communications in the context of one or 
more electronic paper trails, or discussions (305) , where a discussion 
includes at least one actor and at least one documented communication. 
A documented communication may be one or more of, a document, an email, 
an instant message, a facsimile, a voicemail, a phone call, a wire 
transfer, a fund transfer or an electronically traceable package. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is also included for an 
apparatus to present data. 

USE - For data mining purposes in electronic information systems, 
by visualizing the relationships among and enabling retrieval of one or 
more groups of documents satisfying a user-defined criterion or set of 
criteria . 

ADVANTAGE - The visualization of actor communications in the 
context of one or more discussions enables a search technique that can 
.return sets, of related .documents that. .are not merely grouped, by textual 
similarity, but also grouped and sequenced according to the social 
context in which they were created, modified or quoted, thus making it 
possible to retrieve a very precise set of documents from a large 
corpus of data using relatively simple search queries. 

DESCRIPTION OF DRAWING (S) - The figure is a block diagram of 
navigation flow in one embodiment of an apparatus to visually present 
discussions for data mining purposes. 
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Team zneinber profile for collaboration among geographically -dispersed team 
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participating in different types of communication 
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Abstract (Basic) : US 20020078150 Al 

NOVELTY - The team member profile comprises communication 
information defining the preferences of the respective team members for 
participating in each of ^h/e different types of communication. 

DETAILED DESCRIPTION^- An INDEPENDENT CLAIM is included for method 
of enabling a team member to select a profile. 

USE - For collaborating among geographically-dispersed team members 
using a distributed application. 

ADVANTAGE - Enables automatically using the communication 
preferences of the users to control the setup of the communication 
sessions in which the user is participant. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
the collaboration service implementing system. 
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Abstract (Basic) : US 20030023663 Al 

NOVELTY - The apparatus identifies prefetch regions in a memory and 
corresponding critical memory references, during compilation of a 
computer program. The references correspond to data needed in a cache 
memory, if a context switch occurs from a process or thread of one 
context to another. The data associated with the critical memory 
references, are prefetched and stored before the switched context is 
resumed. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for data 
prefetch method. 

USE - For enabling compiler to prefetch data for storage in cache 
memory, during context switch in computer system. 

ADVANTAGE - Since the data is prefetched and stored in the cache 
memory prior to resuming from the context switch, the data is ready to 
be referenced efficiently when the thread or process resumes. 
Therefore, omission of cache data is 'prevented and bandwidth' of the 
computer system is efficiently utilized. 

DESCRIPTION OF DRAWING (S) - The figure shows a flowchart 
illustrating the data prefetch method in the event of context switch. 
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Abstract (Basic) : JP 2002297399 A 

NOVELTY - A guard expression is introduced into each source operand 
with respect to a control command or a function (phip) in a specific 
order. A copy of each source operand is inserted in a target variable 
in the same* order, to materialize "the" static single substitution ('SSA) 



function (phi) . Each copy is described by the guard expression with the 
order related to source operand. 

USE - For conversion technique of common sequence of computer. 

ADVANTAGE - Performs the static single substitution easily in a 
short duration using a code with the specific function. 

. DESCRIP.TIPN or DBAVilNGjta) --The figure shows the source code and 
its control flow diagrammatic chart, 
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AU 200215738 A G06F-017/60 Based on patent WO 200250723 

Abstract (Basic) : US 20020076025 Al 

NOVELTY - A request ^jjilb^s age for inviting a specific person to join 
a communication session, is sent from the user (4a-4d) to a 
collaboration service suite (2) . The suite determines suitable 
communication media for the objective person, based on restored dynamic 
presence and availability information of the person, and transmits the 
request to the person using the selected media. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for virtual 
communication session invitation system. 

USE - For inviting a person to join a virtual communication session 
for applications such as customer relation management, in a 
institutional information exchange, distributed health care 
administration, hotline and,J>elp line ..services , and other business^ 
political academic applications through wireless telephone, personal 



digital assistant (PDA), 2-way payer, PC, telephone of the person. 

ADVANTAGE - Provides a user friendly invitation mechanism, by 
automatically selecting communication media for the purpose. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
interaction between a collaboration services and members. 

Collaboration service suite (2) 

Users {4a-4d) 
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Abstract (Basic) : US 20020075306 Al 

NOVELTY - A person is enabled to interact with a virtual team 
environment, by a graphical interface to select one of the different 
communication types and one of the personal identifiers. The 
.communication, session Ls initiated using. the selected communication 
type and personal identifier, 

USE - For initiating communication including one-way messaging, 
two-way messaging, voice, multimedia, video conferencing, etc. 

ADVANTAGE - Flexibility of virtual team environment (VTE) provides 
power tool to facilitate communications collaboration. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram 
schematically illustrating communication between the person and the 



VTE. 
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Abstract (Basic) : US 20020075305 Al 

NOVELTY - The virtual team environment (VTE) client accesses and 
displays the preference and presence information relating to each 
member of the team maintained by the collaborative services suite (2) 
in response to request from one team member. The VTE client initiates 
specific type of communication selected by one team member, with other 
member . 

USE - For virtual team environment. 

ADVANTAGE - Provides user friendly, intuitive mechanism that 
facilitates collaboration among member of geographically dispersed 
team, graphical user interface displays dynamic preference and presence 
information for each member of the team, so that the user has awareness 
of each of the other team members, 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
interactions between collaboration service suite and team members. 



Collaboration service suite (2) 
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Abstract (Basic) : US 20020075304 Al 

NOVELTY - An interface displays dynamic presence and availability 
information and allows a team member to request for initiating a 
communication session with another team member to a collaboration 
services suite. A collaboration services suite application (2) 
establishes a communication session between several team members using 
a communication media in response to the request from the team member. 

USE - Distributed application for facilitating collaboration 
between geographically dispersed team members for business, academic, 
political or 'other sCci'al t^asoAs." AlSo for applications such as 
customer relation management, institutional information exchange, 
distributed healthcare administration, hot-line and help-line services. 
For communicating using wireless personal digital assistant (PDA), 
wireless telephone, two-way pager device, personal computer. 

ADVANTAGE - A user-friendly, intuitive mechanism is provided that 
facilitates seamless two-party and multiparty communication among 
members of a geographically dispersed team, without the user requiring 
to have prior knowledge oif ^network addresses or communication setup 



procedures . 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram 
schematically illustrating interactions between the collaboration 
services suite and team members. 

Collaboration services suite application (2) 
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Abstract (Basic) : US 20020080387 Al 

NOVELTY - A printer (10) in response to an user request, records a 
requested item on a paper. For each item requested to be recorded, the 
printer generates a cover sheet (20) containing human readable marks to 
identify requesting user, the item recorded and a knowledge management 
service (12) associated with the requested item. The service (12) 
stores a record of user rec^uest with an electronic version of requested 
item in a repository (68«j.v 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following : 

(1) Information management method; 

(2) Recommendation generation method; and 

(3) Cover sheet. 

USE - Knowledge management system employing paper user interface 
system e.g. Xerox flowport system for providing document related 
services and recommender services . 

ADVANTAGE - Provides a cover sheet which becomes an input, output 
and surrogate document mechanism enabling access to both document 
repositories and as soci-ated*-knowl edge -management services easily which 
enables the users to rate their printed document to hold a quantitative 
measure of its value. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of the 
knowledge management system. 

Printer (10) 

Knowledge management service (12) 
Cover sheet (20) 
Repository (60) . ^ 
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Abstract (Basic) : US 20020075303 Al 

NOVELTY - A collaboration services suite (2) enables communications 
between the members of a team using different communications media. A 
GUI enables team member to view communication icons showing presence 
and availability data of different communication devices associated 
with the respective team members and initiates a request for 
communication session with another member of the team by selecting one 
of icons associated with other member. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for virtual 
team environment establishing method. 

USE - For facilitating collaboration among geographically dispersed 
team members . 

ADVANTAGE - Provides dynamic preference and round-the-clock 
presence information to permit transparent communication sessions among 
team members. The integration of text, voice and multi-media 
communications facilitated in a single, seamless virtual team 
environment provides a powerful tool to facilitate collaboration in a 
manner that closely simulated physical collocation of team members. 

DESCRIPTION OF DRAWING(S) - The figure shows the block diagram 
illustrating the intera^)LG<ns between the collaboration services suite 
and members of a team. - 



Collaboration services suite (2) 
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Abstract (Basic) : WO 200235357 A2 

NOVELTY - A complex object is reflected to determine fields 
associated with complex object; 'The Java ' serialization is used' to 
serialize the complex object to allocate memory space for the object. 
The instances of the test object are created and initialized with the 
test data to load test the complex object. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
following : 

(1) Enterprise test system; 

(2) Enterprise system .testing method; and 

(3) Computer program ||jiioduct for testing an enterprise system. 
USE - For synthesi?JT\g complex objects using distributed computing 

for enterprise application system. 



ADVANTAGE - The need for code compilation when generating test data 
for component under test, is eliminated by utilizing Java reflection. 
Simplifies testing of enterprise systems by setting the context for 
component under test. Tla^Vt'est object can be load tested with user 
provided data, without 'the need to invoke the method to construct the 



DESCRIPTION OF DRAWING (S) - The figure shows the top level block 
diagram of system testing enterprise system, 
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Information provision in 'e-mail glosser, involves using ranked 
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expression to present information about sequence to user in 
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Abstract (Basic) : US 6393389 Bl 

NOVELTY - A subexpression of a multitoken expression in a natural 
language is received and a' choice is obtained for translation of asset 
of subexpression to ano^Wi^ language. A subset of the subexpressions 
translation choices ar^ ranked and the ranked choices are used to 
produce a translation choice sequence. The information about the 
sequence indicating the meaning of the multitoken expression is 
presented to the user in an understandable language. 

DETAILED DESCRIPTION - Glossing is a machine automated translation 
that can include multiple translations and ambiguities 

INDEPENDENT CLAIMS are included for: 

(1) Information providing machine; 

(2) Article of manufacture comprising recorded medium storing 
information providing program; 

. (3) Data .transfer methpci, ... 

USE - For providing information about meaning of multitoken 
expression in understandable language in glossing web browser, e-mail 
glosser, news glosser, subtitle glosser and speech translator. 

ADVANTAGE - The complexity for appropriately indicating the chosen 
translations of subexpressions are eliminated by ranking the 
subexpression translation choices to produce a sequence of translation 
choices. An entire multitoken expression is handled rather than a 
single word to form a sequence of translation choices which indicates 



object . 




DESCRIPTION OF DRAWING (S) - The figure shows a schematic flow 
diagram for obtaining a sequence of translation choices for a 
multitoken expression and presenting the information indicating the 



meaning of the expression in an understandable language, 
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Data transfer in microprocessor, involves transferring data portions in 
on-chip register to off -chip memory resource and to successive collection 
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Abstract (Basic) : US 6314513 Bl 

NOVELTY - The data portions stored in an on-chip register (106) is 
transferred to a off-chip memory resource (122) and to a collection 
register. If the collection register contains a predetermined quantity 
of stored data, the sto^yi -data is transferred to next collection 
register. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following: 

(a) Content restoring method from a memory resource to a register; 

(b) Content transferring apparatus between register stack and a 
memory resource; 

(c) Processor; 

(d) Computer system 

USE - Used in microprocessor for transferring data between a 
register stack and a memory resource and also in digital signal 
processors, graphic and video processors or network processors. 

ADVANTAGE* - The data are easily and efficiently transferred between 
a register stack and a backing store in the memory resource. 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of a 
microprocessor that incorporates a register stack. 

On-chip register (106) 

Off-chip memory resource (122) 
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Customizable electronic invoice system for use on a data processing 
server-client network with optional security and sender only access to 
grant authority to modify invoice fields 
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Abstract (Basic) : US 6282552 Bl 

NOVELTY - Customizable electronic invoice system with optional 
security and a number of rules for use on a data processing 
server-client network comprises a document generator (210) connected to 
a number of clients where one or more senders only have the ability to 
alter the user invoice (217) fields or define access for client/users 
to the original document. Also provides tracking of any changes made by 
recipients to client preferred files. 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for the 
methods used by the data processing system for client and server use. 

USE - The- sys-tem*is- used- to -provide a customizable electronic • 
invoicing service over a client-server network. 

ADVANTAGE - The system provides a method for users , and senders to 
view both the original document and any subsequent changes made before 
receipt and the ability to authenticate any signatures included with 
the modifications. 

DESCRIPTION OF DRAWING (S) - The drawing shows a functional block 
diagram of components for the billing processing system. 

Bill/Document generator^ (210) 

Original user bill/ctecument (217) 
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Web- site end-user assessment system for analyzing moods of end-users of 
web-sites relative to task a^phs of web-sites, in which problems 
associated with end-user Acsods are fed back to business knowledge users 
of web- site 
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Abstract (Basic) : WO 200127850 A2 

NOVELTY - Task models 'and graphs for a web-site are constructed and 
various data collectors >^Wl^lect data regarding end-user behavior on the 
web-site. The data is then sorted and filtered, and information 
obtained regarding end-user success on the various tasks that can be 
performed using the different pages on the web-site. Using this 
information, the task model of the web-site may be augmented with new 
edges which reflect unexpected end-user path segments. 

DETAILED DESCRIPTION - The system for assessing a state of an 
end-user of a web-site includes a data collector for collecting data 
regarding the behavior or an end-user on the web-site, and a task model 
creator for creating a task model and a task graph for a number of 
tasks on the web-site. A weighting mechanism weighs the data collected 
by the data cpllectojc . A h^u^istias dete^rmination mechanism assesses 
the state of the end-user, based upon the data collected by the data 
collector and the weights assigned to the data by the weighting 
mechanism. INDEPENDENT CLAIMS are included for; a method for assessing 
a state of an end-user of a web-site; a method for assessing a 
satisfaction level of an end-user web site. 

USE - Analyzing and interpreting customer behavior at e-commerce or 
other goal orientated web-sites. 

ADVANTAGE - Enables de'termination of mood, states of end-users, 

DESCRIPTION OF DRAW^J^C's) - The drawing shows a system in 
accordance with the invention. 

Task model creator (110) 

Data collector (120) 

Weighter (130) 

Heuristics determinator (140) 
Report generator (150) 
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Method of displaying customer states at a Website for analyzing customer 
behavior by displaying graphical representations of tasks performed by 
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Abstract (Basic) : WO 200127*801 A2 

NOVELTY - A high level graphical representation of a website may be 
presented as a campus of buildings each building corresponding to a 
given task performed by customers on the website. A floor plan view of 
each task represents the steps of the task as rooms within the 
building. Certain tasks can be divided into sub-tasks which are 
represented as multiple floors within the building. Customers are 
represented graphically within the rooms in the buildings, and may be 
represented by the location, mood and actions of animated characters. 

DETAILED DESCRIPTION - An INDEPENDENT CLAIM is included for a 
system for *di's playing cust6ftt6r ' states "at ' a website. 

USE - In analyzing customer behavior at a website. 

ADVANTAGE - Provides an easy and understandable way of presenting 
customer states . 
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Abstract (Basic) : US 606511^A 

NOVELTY - The occurrence of an interrupt command is determined. A 
first register (IFM) configured to store a content of a second register 
(CFM) that is configured to store control information related to a 
first portion of a register stack of the microprocessor is invalidated. 
The necessity for an interrupt handler to access the register stack is 
determined . 

DETAILED DESCRIPTION - If the interrupt handler needs to access the 
register stack, the first register is validated, the content of the 
second register is copied to the first register, and the register stack 
engine of the processor is made to exchange information between the 
•first and s'econd portions Of" th6 register stack. An INDEPENDENT CLAIM 
is also included for an execution method of a cover instruction to 
synchronize an interrupt handler with a register stack engine. 

USE - For switching contexts in processor with register stack. 

ADVANTAGE - Uses excess processor memory bandwidth to dynamically 
spill or fill registers from stacked register file to a backing store 
in a memory concurrently with program execution. Provides a way of 
synchronizing spilling and filling of registers with a processor's 
execution of instructionsi v^en a switch from a source to a target 
context is required to .itfake possible a return to the same context and 
resume operation in the source context as if no context switch 
occurred. Provides way of saving and restoring, in an efficient manner, 
the contents of stacked registers of the stacked register file upon 
interrupt and return from interrupt, respectively. 

DESCRIPTION OF DRAWING (S) - The figure shows a block diagram of a 
microprocessor in which the computer-implemented method of switching 
contexts is applied. 
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Abstract (Basic) : US 6065880 A 

NOVELTY - A microprocessor (38) is electrically coupled to an 
interface connector (36T**that can be removably electrically coupled 
with the PDA (12) . A light source (44) is configured to emit a light 
beam. A photo detector (56) is electrically coupled to the 
microprocessor . 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are also included for the 
following : 

(a) an electrical apparatus; 



(b) and a PDA. 

USE - For personal data assistant. 

ADVANTAGE - Allows information to be easily downloaded into host 
computer without having to physically electrically couple PDA to host 
computer. Allows quick input of data to PDA. Enables automatic input of 
bar code readings into PDA, thereby enabling PDA to be effectively used 
for monitoring and controlling inventory or other products on which bar 
codes can be positioned. Minimizes size of PDA since adapter is 
removable . 

DESCRIPTION OF DRAWING(S) - ,The figure is the block diagrams of the 
components of the adapter and the components of a computer. 
PDA (12) 

Interface connector (36) 
Microprocessor (38) 
Light source (44) 
Photo detector (56) 
pp; 10 DwgNo 2/5 

Title Terms: PERSON; DATA; ASSIST; LIGHT; SOURCE; EMIT; LIGHT; BEAM; PHOTO; 

DETECT; ELECTRIC; COUPLE; ,^C?ROPROCESSOR 
Derwent Class: TOl; WOl 

International Patent Class (Main) : G06F-013/00 
File Segment: EPI 



5/5/37 (Item 31 from file: 350) 

DIALOG (R) File 350: Derwent WPIX 

(c) 2004 Thomson Derwent. All rts . reserv. 



013180801 **Image available** 

WPI Acc.No: 2000-352674/200031 

XRPX Acc No: NO'O-2642'83 • - • 

Method for relating profile data collecting during execution of optimized 
computer program back -to -the source language description of computer 
program by relating each accumulated count to actual line number of a 
branch instruction 

Patent Assignee: HEWLETT-PACKARD CO (HEWP ) 

Inventor: BUZBEE W B; RUSCETTA M A; THOMPSON C L 

Number of Countries: 026 Number of Patents: 004 



Patent Family: 



Patent No Kind Date - Applicat No Kind Date Vieek 

EP 1004961 A2 20000531 EP 99308943 A 19991110 200031 

US 6275981 Bl 20010814 US 98190994 A 19981112 200148 

EP 1004961 Bl 20030730 EP 99308943 A 19991110 200356 

DE 69909945 E 20030904 DE 609945 A 19991110 200366 

EP 99308943 A 19991110 



Priority Applications (No Type Date) : US 98190994 A 19981112 
Patent Details: 

Patent No Kind Lan Pg Main IPC Filing Notes 
EP 1004961 A2 E 26 G06F-009/45 

Designated States (Regional) V'aL ' AT BE^CH'cY DE DK ES FI FR GB GR IE IT 

LI LT LU LV MC MK NL PT RO SE SI 
US 6275981 Bl G06F-009/45 
EP 1004961 Bl E G06F-009/45 

Designated States (Regional) : DE FR GB 
DE 69909945 E G06F-009/45 Based on patent EP 1004961 

Abstract (Basic) : EP 1004961 A2 

NOVELTY - The method ^iVvolves relating each accumulated count to 
the actual line number 'oi a branch instruction associated with the 
count and the logic line number of the basic block that contains the 
target instruction associated with the count. 

USE - In run-time performance analysis of computer software 
programs compiled by optimizing compilers for correlating tun time 
profile data collected during execution of an optimized machine code 
version of program with the source language program from which the 
machine code version of the program is generated. 



ADVANTAGE - Provides improved method of collecting data. 

DESCRIPTION OF DRAWING (S) - The drawing shows a control flow graph 
that represents the intermediate level assembly language version of the 
routing * skewed*. 
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Abstract (Basic) : WO 9919794 A 

NOVELTY - A device is used for communicating second information. 



between an operating system and an application, about whether the 
operating system will attempt to recover from the exception prior to 
deferral. A device is used for communicating third information, between 
the operating system and the hardware, that indicates whether the 
hardware should defer the exception based upon at least one of the 
first information and the second information, and whether the 
instruction is speculative. DETAILED DESCRIPTION - One of the bits is 
the . instruction t.ransl^.tiop,^loo]j-asi4§ byffer (TLB) (331) entry caJLled 
ITLB.ed (the ed is for exception deferral). This bit controls whether 
any exceptions on any speculative loads contained in the present page 
can be deferred in hardware. By defining a bit (321) in the TLB entry 
to control hardware exception deferral, different software modules 
(which map to different pages in memory) can set this bit independently 
thus allowing each module to independently include recovery code. The 
inventive mechanism also uses multiple bits in a control register (DCR) 
(332) . The preferred embodiment is to have one bit per exception type. 
Other mappings of bits ^Wxception types are possible, e.g. a single 
bit controlling multiple exception types. Given the one-to-one 
correspondence between OCR bits and exception types, the operating 
system (OS) (320) has the flexibility to select (322) which exception 
types can be deferred in hardware and which ca nnot. 

USE - The invention applies to any type of speculative instruction 
used in operating system control of a hardware deferral of an exception 
that has occurred in execution of an instruction in application. 

ADVANTAGE - The invention enables a mechanism to defer exceptions 
on speculative instructions that applies to as many forms of 
speculation as possible. The mechanism must posses very low latency 
otherwise the, per.forrtiaace 9;^. a . pro.gr^in cpmpiled with speculation m^y 
actually be lower than the same program compiled without speculation. 
The mechanism must also place minimal restrictions on the form and the 
construction of software in order to allow the execution of legacy 
software, to minimize the impact on software developers, and to 
maximize the range of software implementation choices. The mechanism 
allows the computer system to dynamically adapt to program behaviour in 
order to maximize performance over the broadest possible range of 
software DESCRIPTION OF DRAWING (S) - The drawing a schematic diagram 
with system implementing ^f/e ration scheme according to the present 
invention. (320) operating system (OS) ; (331) instruction translation 
look-aside buffer (TLB) ; (332) control register (DCR) . 
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Abstract (Basic) : WO 9917207 Al 

NOVELTY - The apparatus transfers data from a register stack (86) 
to memory and has a temporary storage facility (108 or 110) . Transfer 
logic transfers N bits of 1st register contents in the stack to a 
memory 1st location, and M bits to 1st location in the temporary 
facility. When preset number of data transfers have occurred logic 
transfers temporary facility accumxilated' contents to 2nd location In 
memory. 

USE - For providing a method and apparatus for transferring data 
between a register stack and a backing store defined in the memory of 
the microprocessor. 

DESCRIPTION OF DRAWING (S) - The drawing shows a schematic 
representation of apparatus for transferring data between a register 
stack and a memory resource. 

register stack (86) tj 

temporary storage f^fellities (108 and 110) 
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ARRANGED RANGE CHECK CODE, METHOD AND DEVICE FOR ELIMINATING 

UNNECESSARY ARRANGED RANGE CHECK / METHODS FOR SELECTING, MODIFYING, 
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ABSTRACT 

PROBLEM TO BE SOLVED: To remove unnecessary arranged range check by 

collecting arranged range >' check information by using data flow 

analysis , etc., and performing move up of the check / etc. 

SOLUTION: C-GEN[B] is collected according to a specified condition in the 
reverse order of execution in respective basic blocks B (110) and 
information about the C- GEN[B] is stored in a storage device. And the 
C-GEN[B] is propagated as adding necessary correction in the order of post 
order traversal for priority search of depth, C-IN [B] of the respective 
basic blocks B (C120) is generated and information about the C-IN[B] is 
stored in the storage device. Finally, a check code for versioning and an 

execution code in respective execution states are generated in front of 
a loop by -using -the.C-IN[B] (130).. The .check code and the. execution 

code are stored in the storage device for execution . Thus, the arranged 

range check in the loop is removed. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To generate a conversion program to optimize 
calculation by discriminating the repetition range of a loop when an 
arithmetic operation to use^ an array element that an adjacent process has 
exists in the loop arid to generate a conversion program to make a 



cornmunication between an execution statement in the loop and the array 
element of the adjacent process at the same time. 



SOLUTION: A parallel compiler 101 consists of a syntax analysis part 102, 
a data division analysis )■ part 103, a calculation division part 104, a 
communication generation .part 106, a conversion program generation part 
107, a calculation division optimization part 105 which optimizes the loop 

by dividing it into a part to use the adjacent array element and a part to 
use only data in its process, and performs code conversion for 

performing the transfer of data and the execution of a loop main body 
in parallel. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To accelerate the processing speed of a parallel 
program by generating an optimum 'parallel ' program into which a code' that 
decides date division information at the time of executing is inserted. 

SOLUTION: A data dividing part 6 contains a dynamic data division deciding 
part 7 which detects a dynamic variable that is related to the decision of 
data division from an intermediate word 91 and data division information 
92, performs a dynamic data division analysis and outputs dynamic data 
division deciding information 96. It further contains a dynamic data 
division deciding code generating part 8 which inputs the information 96 
and generates a dynamic , 'data division deciding code. It detects a 
declaration measure of each data in a sequential executing source program 
1 that is inputted in the part 6, the action range of a loop control 
variable and the dynamic information among reference patterns. A parallel 
program into which an execution time deciding code is inserted is 
generated by recording a part that dynamically decides in the information 
92 and generating the execution time deciding code to the part that 
dynamically decides in the information 92. 
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ABSTRACT 

PROBLEM TO BE SOLVED: To gather and display the dynamic tuning information 
of only a pertinent tuning range and to support the tuning operation of a 
program by indicating a range to perform the tuning operation of a 
source program. 

SOLUTION: For a source program 101, a tuning range indication part 102 
generates a tuning range specified source program 103 and a compiler 104 
generates tuning range information 105. A tuning control part 110 and a 
tuning information gathering* part • 111 execute a measurement program 106 
while referring to the tuning range information 105 and gather the tuning 
information 112. After execution is ended, a tuning display part 113 

performs editing by a tuning range unit and a loop unit and performs 

visible display on a display screen 108. 
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ABSTRACT 

PURPOSE: To convert a successi^ve program for single processor to a divided 
program for multiprocessor not to execute any useless loop repetition. 

CONSTITUTION: A successive source program 101 with data dividing 
instruction is inputted, translated to an intermediate word 103, registered 
on a dictionary 104 and set to data division information 105. In program 
dividing processing 106, these elements 103-105 are inputted and deciding 
processing 107 of out-of- loop sentence execution PE and loop range 
dividing processing 108 are executed. In the processing 108, executing 
loop range deciding processing 109 for respective sentences inside the 
loop and loop range deciding processing 110 are executed. The result 
of the processing 106 is stored as program division information 111 and 
added to the intermediate -word- 103; Next/ the elements 103-105 and 111 are 
inputted, non-local data are analyzed , non-local data analysis 
information 112 is set, inter-processor communication generating processing 
113 is performed and the result is added to the intermediate word 103. 
Finally, the elements 103-105 and 111 are inputted, node program generating 
processing 114 is performed and a node program 115 for each PE is 
outputted. 
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ABSTRACT 

PURPOSE: To increase opporfi'unity to reduce the number of execution 
instructions of a loop'^^^.^bV performing the addtion of an index and 
completion discrimination by the same instruction by extending the 
applicable range of an update branch instruction in an optimizing system 
for the compiler of a computer program. 

CONSTITUTION: In a step 101, the facts that the update operation of a 
control variable in the loop is repeatedly performed once every update 

execution / that the index is updated by the same quantity every 
repetition, that the value of the control variable at a time when it goes 
out of the loop is not used anywhere, and that the reference to the 
control variable is limited to only access to an array element are checked 
In a step -102, the address. . of . the final - element is calculated, and the 
maximum value of an index equation can be found. In a step 103, -N that is 
the result of subtraction of the number N of times of repetition from an 
initial value 0 is set as the initial value of the control variable, and in 
a step 104, the access instruction of the array element is generated. In a 
step 105, the update branch instruction is generated, and a code to 

perform the completion discrimination of the loop is generated. 
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PURPOSE: To 
performing 



improve the 
optimization,' 



^ ' ^ ABSTRACT 

.Execution performance of a program by 

by which data in a multiplex loop can be 



accessed at a high speed, in a wide range . 

CONSTITUTION: With respect to the computer language processing method which 
performs the optmization processing at the time of translation to an 
object program 15 of a program 10 which is operated on a computer having 
a hierarchical memory, loops having a tight structure are detected in an 



optimization processing part 13 to recognize loops which can be blocked. 
Data overlap analysis is used to substitute loops so that the access 
distance of array data in the innermost loop is shortened. Candidate 
arrays which can be blocked are extracted, and a division object array and 
a division object index are determined from candidate arrays. The divided 
block length is so determined that arrays after division can be stored in a 
cache memory, and the loop is transformed. 
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ABSTRACT.. 

PURPOSE: To permit vectorization of a sentence group and to shorten the 
program executing time by deforming an intermediate text to a scalar 
variable having the reliance between the data conflicting with each other. 
CONSTITUTION: A sentence structure analyzing means 21 of a compiler 2 
reads a source program 1 and produces an intermediate text 26. A data 
reliance deciding means 22 of a vectorization means 4 detects all array 
elements and scalar variables out of the text 26 in regard of the pool 
structure contained in the program 1. Then the means 22 turns off the data 
reliance flag contained in^tHe^ element data and turns on the data reliance 
flag of an element table corresponding to the element having the data 
reliance conflicting with the parallel execution among those elements 
included in a loop . A sentence replacement processing means 23 performs 
the replacement of the sentences including the corresponding elements to a 
sentence group having the conflicting data reliance. A variable data 
reliance dissolving means 24 deforms the text 26 and handles the conflict 
scalar variable in the same way as the one-dimensional array element using 
a vector work area . Thus the conflict is dissolved. A code generating 
means 25 generates an object program 3 from the text 26. 
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ABSTRACT 

PURPOSE: To enlarge the objective range of parallelizing by classifying 
simple variables and arrangement in a multiple loop from the viewpoint of 
parallelizing in a parallelizing processing and introducing work 
variables and work arrangement -to a case that parallelizing* can not be 
executed conventionally . 

CONSTITUTION: A FORTRAN program 51 is converted to an intermediate word 53 
by a syntax analysis processing 52. In an intermediate processing 54, 
this word is inputted and data dependency analysis is executed. Then, a 
data dependency graph 57 is prepared and control dependency analysis is 
executed. Afterwards, a control dependency graph 58 is prepared and in the 
parallelizing processing, sections to be parallelly executed by plural 
processors are found out. Then, an optimumizing processing is executed and 
the converted intermediate word 53 is outputted. Next, in a code generation 
processing 55, a main memory is allocated to the variable and an object 
code 56 is generated. Thus, concerning the intermediate processing 54, the 
wider range is parallelized, the execution ability of the object code 
56 is improved and the objective range of parallelizing is enlarged. 
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ABSTRACT 

PURPOSE: To enable vectorizing even in a case where reference exists to the 
scalar variable of the left side of an assignment statement by allotting a 
vectoring work area to -the* regressive -operation of a recurrence formula 
can be applied. 

CONSTITUTION: When a loop structure is analyzed in the course of 
compling, and data dependence incompatible with parallel execution 
exists, a regressive operation processing part 223 is started. The 
regressive operation pattern recognizing part 2231 of the processing part 
223 investigates the structure of a formula, and detects the statement of 
the regressive operation i ^with the specified pattern of an inner 
product/total sum for the scalar variable. A definition/ reference position 
relation investigating part 2232 investigates definition/reference position 
relation in a loop for the scalar variable of the detected statement. As 
the result of the investigation, when the reference exists for the scalar 
variable of the left side of the assignment statement after the statement 
of the regressive operation, a regressive operation vectorizing processing 
part 2233 allots the vectorizing work area to this scalar variable, and 
transforms it into the statement where the regressive operation of the 
recurrence formula can be applied. 
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ABSTRACT 

PURPOSE: To enhance the parallelism between DO loops made into vectors 
and other range and to increase the efficiency of execution by grasping 
•its. data dependency in -widew range -and making the optimum- sequencing 
process . 

CONSTITUTION: Data analyzed in a source analyzing section 1 is given to 
an address allocating section 2, and there, memory area is allotted, and 
initial values are given to arrays and variables. A vector making section 3 
converts DO loops to vector instruction strings, and a sequencing 
processing section 4 grasps its data dependency in wide range / and makes 
the optimum sequencing of instruction to the data dependency by using a 
pipe line ID or synchronizing instruction. An intermediate text optimizing 
section 5 makes optimizing after making into vectors, and a register 
allotting section 6 performs processing such as allotting data to 
registers. An instruction generating section 7 converts an intermediate 
test to a machine language instruction. 
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representation to monitor for specified events 
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Abstract (Basic) : WO 200148608 A2 

NOVELTY -' Locations in* a* prbgram are * selected using changes and 
test coverage and control flow and data analysis and early warning 
codes monitoring for an event are inserted in the selected locations. 
On detecting the event, the early warning code provides an early 
warning or issues an early action. In one embodiment an early warning 
is issued if the value of an argument to a selected function is 
outside a predetermined range . Early warnings may be issued if a 
loop is executed more than a predetermined number of times or when the 
time elapsed between the y^execution of two points in the program 
exceeds a given value Qf*"if the program enters a block of untested 
code. The warning may be issued by sending an e-mail to an operator, 

DETAILED DESCRIPTION - INDEPENDENT CLAIMS are included for 

(a) a computer system for issuing an early warning 

(b) a computer program product for issuing an early warning 

(c) and a computer memory configured to provide an early warning. 
USE - In data processing systems. 

ADVANTAGE - Provides an early warning in advance of any failure or 
crash . 
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Compiler for computer, writes-in the lead address value of condition 
process, referring to which execution of loop process in a program is 
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Abstract (Basic) : JP 20001945^9^ A 

NOVELTY - The sourg^program consists of condition branch 
instruction loop , provided between a common process and condition 
process. The lead address value of the condition process is written by 
lead address setting unit (126) , referring to which execution of the 
loop is performed by jump instruction converter (127). 

DETAILED DESCRIPTION - The loop process portion is extracted from 
program by optimization unit (123) . The recording area of the 
variable that controls loop frequency is analyzed by recording 
area analysis unit (125) for selecting the area to record the lead 
address value of condition process. INDEPENDENT CLAIMS are also 
included for the following: 
la) compile procedure;*''' 



(b) compile program; 

(c) target program 

USE - In e.g. compiler for computer. 

ADVANTAGE - Execution of a loop in a program is reduced, thus 
the process time of the system is reduced, thereby reducing the 
capacity of targe,t pj;ogram,,^ . . 

DESCRIPTION OF DRAWING (S) - The figure shows the block diagram of 
compiler . 

Optimization unit (123) 

Recording area analysis unit ( 125 ) 

Lead address setting unit (126) 

Jump instruction converter (127) 
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Abstract (Basic) :. US 599^056 A, ... 

NOVELTY - An intermediate signal from two image signals, 
represented as signed 8-bit integer is obtained using registers. By bit 
wise shifting, it is checked whether the intermediate signal is 
within the range of positive and negative overflow state, and sets a 
mask signal to have 8 lower bits in OFF position or ON position, 
accordingly. 

DETAILED DESCRIPTION - The processor includes registers in graphics 
unit (38) for storing two image signals and obtaining intermediate 
signal. By bit wise shi^^tWg of intermediate signal to four positions 
to right a masking signal with a value 8, is obtained. By dividing the 
intermediate signal by 16 and storing the dividend, the processor 
determines the positive or negative overflow state. The integer 
execution unit (28) has arithmetic and logic units for performing 
arithmetic, logical and shift operations. Bit wise AND operation is 
performed on intermediate signal with mask signal to obtain translated 
data signal which is ORed bit wise with another mask signal. 

INDEPENDENT CLAIMS are also included for the following: 

(a) signed data signal optimization method; 

(b) signed data signal optimization program on a computer readable 
medium. . , . . . , ,^ ... 

USE - For checking audio and pixel data of multimedia signal in 



computer system using superscalar pipelined processor. 

ADVANTAGE - Since the resulting pixel or signed 8-bit integers 
remain within the boundaries of signed 8-bit integers, the use of 
conditional branches is eliminated, as three shift operations, two 
logic multiplication's and an addition operation is employed. Thereby, 
instruction throughput of processor is improved and efficiency of 
instruction pipelining is enhanced. 

DESCRIPTION OF DRAWING (S) - The figure shows schematic block 
diagram of superscalar pipelined processor. 
Integer execution unit (38) 
Graphics unit (38) 
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Optimum parallel execution program generation method using parallel 
conversion compiler for distributed parallel processors - involves 
outputting dynamic data division judging information based on dynamic 
information utilised for data division, according to dynamic data tally 
rule to generate code 
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Abstract (Basic) : JP 10228455 A 

The method involves inputting a source program (1) sequentially and 
performing syntactic analysis - A data division analysis is 
performed using the syntactic analysis information. A program 
division information is generated by performing division of the loop 

repetition range from the syntactic analysis information and data 
division information. A communication information is produced from 
syntactic analysis information, data division information and program 
division information. The parallel execution program is output based 
on the syntactic analysis information, data, program division 
information and communication information. 

During data division process the information sequentially utilised 
for data division is extracted from the source program. A dynamic 
information is detected'^rom the information utilised for data 
division. A dynamic data division judging information is output based 
on the detected dynamic information, according to a dynamic data tally 
rule to generate a code. 

ADVANTAGE - Improves processing speed. 
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Abstract (Basic) : JP 9152982 A 

The method involves using a performance analysis program to 
analyse a computer program at a sentence number table (13). The 
sentence number table stores the relative address and the head address 
of a source program sentence number. An execution formal program 
which translates the source program is obtained. A loop information 
table (14) stores the sentence number of a loop processing program to 
perform array operationv cprrection . The execution of the execution 
formal program is interrupted to search for a program sentence under 
execution • 

The relati ve address and the head address of the area maintaining 
the program is obtained. Based on the relative address, the sentence 
number of the program sentence under execution is obtained referring 
to the sentence number table. The modality of the program sentence is 
searched referring to the loop information table. The execution 
cost of the computer program is computed based on the loop processing 
execution frequency and the array operation. 

ADVANTAGE - Efficiently updates computer program. 
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Abstract (Basic) : JP 8305672 A 

The processing method uses a parallel conversion compiler including 
a syntax parsing unit (102), a data division and analysis device 
(103), a calculation di^fciaer (104) and a communication device (105). 
The data division device converts a program counter to a parallel 
counter for providing certain processes. An array element is assigned 
on analysing the program counter with the data division directions. 

A sentence is processed based on the relation between the rule and 
the array subscript which connect the array and a sentence and a loop 
control variable. The execution range of the sentence which is with 

range of the loop control variable is determined, a 
non-transposition section sequence processing is performed , and a 
loop duplicate processing is performed in the original loop . 
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Abstract (Basic) : US 5581762 A 

The compiling appts . includes a unit (2) for analysing a 
structure of an intermediate language program of a source program (11) 
output from a front-end unit (1), and for recognising program units 
constituting loops . A unit (3) analyses a scope of declarations 
of respective data expressions in the loops . A unit (4) statically 
analyses an overlap of two memory addresses of data expressions of 
each pair and- outputs an overlap analysis result. ... 

A unit (5) inserts an instruction to compare the two memory 
addresses of data expressions of each pair whose overlap is judged to 
be obscure, for generating a number of paths defined by combinations of 



conditions whether or not the two memory addresses of data expressions 
of each pair overlap, and for generating an instruction to branch to 
one of the number of 'pa'ths * ^^ccofding ' to a comparison result obtained by 
the instruction in an execution of a compiled program. A unit (6) 
respectively optimises the number of paths generated by the aliasing 
address comparison instruction generating unit. 

ADVANTAGE - Can carry out optimisation of data expressions whose 
overlaps are judged to be obscure. Can output more efficient object 
program. 
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The memory reorderingy^raethod involves decoding instructions 
issued by a proces55*t. If a decoded instruction is an out of 
order load instruction it is determined if the instruction 
generates an exception. A delayed exception bit associated with a 
target register of the load instruction for an out of order load 
instruction generating an exception is set. A memory address of an out 
of order load instruction not generating an exception is saved and a 
valid bit is set for this address. It is determined if a decoded 
instruction is a store operation. A range of memory addresses 
referenced by a decoded store instruction are compared with all entries 
in the address comparator. For each match the valid bit of the 
corresponding entry is set to invalid. 

It is determined "if a decoded instruction is a commit operation. 
The valid bit of the address comparator entry associated with a target 
register of the decoded commit operation. A delayed exception is 
generated if the valid bit is set to invalid. The delayed exception bit 
of a source register of the commit operation is simultaneously checked 



. If the bit is set a delayed exception is generated. An exception 
• instruction is aborted -and* control- is -transferred to an exception * 
handler. 

ADVANTAGE - Allows moving of load instructions earlier in 
execution stream allowing for arbitrary distance between reading from 
memory and using data loaded out of order. Not restricted to moving 
load operations out of loops and can tolerate ambiguous memory 
references. Provides high performance and integrates memory 
disambiguation with speculative execution . Copes with ambiguous 
store/load instructions. . ^ 
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The method uses a source program counter (110) which connects and 
assembles a printed-circuit board (510) designed to connect one or more 
other circuit boards. A test coverage information table is created by 
extracting a correction difference information (210) and an execution 
sentence information from the counter. 

Afterwards, the test coverage information implements a test 
procedure by initially se^t^ng the correction range to a 
nonperf orming state fol3?€)wed by a performing state. The execution 
path provides the regular update on the table. A test coverage 
measurement program (400) is designed to test the source program 
counter by easily monitoring the test range and the test progress 
situation. 

ADVANTAGE - Enables easy test range distinction before test 
starts and provides loop to monitor easily test progress situation 
by referring to table. 
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Abstract (Basic) : WO 9103791 A 

Objects including rules and data are stored in buffers identified 
by a buffer address. A current rule including a static data area and 
a modifiable data area is executed. The static data area stores 
object identifiers with offsets to positions in hte modifiable data 
area at which buffer addresses of buffers storing identified objects 
are to be located at execution time. 

Objects are found ina'^the buffers in response to the executing 
device, and buffer address es are supplied to the modifiable data area 
at execution time. 

ADVANTAGE - Frees programmer of explicit recognition in program of 
environmental parameters . (2pp Dwg.No. 1/21 
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Abstract (Basic): DE 3802025 C 

The buffer memory control allows command words from a main memory 
(MEM) to be loaded in the buffer memory (IBUF) in succession, with 
subsequent addressing of the stored commands via command addresses and 
transfer of the read commands to a command decoder (IDEC) . An address 
pipeline (APL) controls the addressing of the buffer memory locations 
in response to the received loading addresses (FA), providing a delay 
which corresponds to the time required by a memory access control 
(MEMG) to access the -memory^'in ' the given ' address sequence. 

The distribution address is compared with the address locations of 
the commands already loaded in the buffer memory (IBUF) to allow 
decoding of each command. 

ADVANTAGE - Allows use of large scale IC. 

1/5 
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Priority Applications (No Type Date) : US 80120059 A 19800211 

Abstract (Basic) : BE 887451 A 

To improve the operating speed of a computer, the arithmetic unit 
comprises an assembly of circuits which operate independently, each 
performing a particular task. The tasks are performed during 
successive cycles with instructions flowing in pipeline fashion on a 
bus . 1 J 

Instructions from^^memory (100) are transferred to instruction 
registers (131-4) via a common control and data bus (101). A line 
memory (105) stores variable data words via a data register (106) and 
the data bus. Control circuits (131,211,212) decode the instruction 
words for processing sub units (112,115,116) during successive cycles 
of operation. This results in a data word which is transferred to an 
output area in the line memory. Auxiliary commands used on occasions 
ensure that the register contents are preserved during execution of a 



program. 
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visualization and' genomics "dis^:overy tool "designed for the TimeLogic 
DeCypher platform. The client/server system, which works with TimeLogic' s 
DeCypher accelerators, allows bioinf ormatics researchers to tap a broad 
range of data mining and discovery features. Using VIBE 1.0, researchers 
can tap drag-and-drop sequence analysis in pipeline construction. Users 
can also apply data filtering on simple or complex criteria. VIBE 
visualizes all DeCypher algorithms, streamlining processing. VIBE also 
offers research teams distributed, multi-user support options. The 
software's interface is navigated easily. Additionally, VIBE offers results 
from active and completed ^pipelines / and it provides with interactive or 
batch mode pipeline execution options. VIBE includes the Query, Search 
NT, Search AA, Target DB, Utility, Seals, and Visualization modules. The 
Search NT module contains similarity search algorithms using a nucleic acid 
(DNA) target database. Search AA uses an amino acid (protein) database. 
vise's Target DB module contains target database icons. The Utility module 
offers notification, multiple sequence alignment, and other features. 
vise's Seals is employed in standardizing sequence analysis protocols. 
VIBE allows researchers to perform their own sequence analyses , 
reducing data processing bottlenecks. 
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Title: Adaptive loop scheduling algorithm on shared-memory systems 

Author: Jin, Canming; Yan, Yong; Zhang, Xiaodong 
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Conference Title: Proceedings of the 1996 8th IEEE Symposium on Parallel 
and Distributed Processing 

Conference Location: New Orleans, LA, USA Conference Date: 
19961023-19961026 

Sponsor : IEEE 

E.I. Conference No.: 45782 

Source: IEEE Symposium on Parallel and Distributed Processing 
Proceedings 1996. IEEE, Los Alamf'tos, CA, USA, 96TB100088 . p 250-257 
Publication Year: 1996 
CODEN: PSPDF8 ISSN: 1063-6374 
Language: English 

Document Type: CA; (Conference Article) Treatment: G; (General Review); 
T; (Theoretical) 

Journal Announcement: 9702W4 

Abstract: Using runtime information of load distributions and processor 
affinity, we propose an adaptive scheduling algorithm and its variations 
from different control mechanisms. The proposed algorithm applies different 
degrees of aggressiveness to adjust loop scheduling granularities, aiming 
at improving the execution performance of parallel loops by making 
scheduling decisions that match the real workload distributions at runtime 
. To verify the effectiveness of the algorithm and its variations, we 
implemented them on the KSR-1 and on the Convex Exemplar. We experimentally 
compared the performance of our algorithm and its variations with several 
existing scheduling algorithms on the two machines. The kernel application 
programs we used for performance evaluation were carefully selected for 
different classes of parallel loops . Our results show that using runtime 

information to adaptively adjus^^ scheduling granularity is an effective 
way to handle loops with a wid^'' range of load distributions when no 
prior knowledge of the execution can be used. The overhead caused by 
collecting runtime information is insignificant in comparison with the 
performance improvement. Our experiments show that the adaptive algorithm 
and its five variations outperformed the existing scheduling algorithms. 
(Author abstract) 7 Refs. 
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Source: Proceedings of SPIE - The International Society for Optical 
Engineering v 1454. Publ^ by Int Soc for Optical Engineering, Bellingham, 
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Publication Year: 1991 \ 

CODEN: PSISDG ISSN: 0277-7^6X 

Language: English \ y/^ 
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Abstract: A two-axis, high-bandwidth, small^perture steering mirror 
called the High Bandwidth Steering kirror (HgBM) has been designed, 
fabricated, and tested . The mirrorAjnechandsm prototype functions within 
a servo loop either scanning a fi el a. of/View or tracking a radiation 
source. The design focused on elementsNjnaking up the beam-steering 
mechanism: mirror, restoring f le^4^res, /^tuators , position sensors, and 
encompassing housing, and the part ea^ Vjmponent plays in making a 
mechanical system suitable for highyOandwMth operation. Inclusion of a 
novel flexural support allows one-^gree peak-to-peak angular stroke (shaft 
space) at low frequencies and a amall-signaPv closed- loop bandwidth of up 
to 10 kHz without the usual mecjatanical resonance-induced loop 
instabilities. This increased Joandwidth allows\substantial rejection of a 
disturbance spectrum in the 1/0-1000 Hz range and execution of fast, 
complex scan patterns. Poin^ng accuracies of 0.2\mu rad have been achieved 
in the laboratory. Details/of the mechanical desi-gn and fabrication issues 
as well as the control- loipp implementation are discussed. Test data are 
presented along with reports of the mirror's performance in use as an 
extended sensor. (Author abstract) 7 Refs. 
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Abstract: The run - time performance of a strategy (previously proposed 
by the authors) for processing recursive queries in deductive database 
systems is studied. The algorithm, introduced informally by examples, is 
coded in occam2, and runs on a network of transputers. A wide range of 
recursive queries and database structures are used as benchmarks. Both the 
speedup factors achieved and the elapsed time spent by the strategy in 
answering recursive queries are analyzed . Experimental results show that 
it is possible to achieve significant performance improvements when queries 
are evaluated in parallel. These results provide insights into the success 
of this strategy in meeting the primary objective of focusing on relevant 
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When a control loop is closed via a serial communication channel, it 
is labeled a Networked Control System (NCS) . The primary objective of NCS 
design is to efficiently use the finite bus capacity while maintaining good 
closed- loop control system performance . In this dissertation, the focus 
is on the problems related to scheduling network traffic. The controller is 
assumed to be designed in advancq without considering the effect of the 
network. Various scheduling schei^s are then proposed, analytically 
verified and compared via simulations with fixed controller designs. Here, 
the primary interest lies in the dynamic scheduling of network traffic; 
that is, developing protocols that decide at runtime which channel to 
transmit and which to block. 

For wireline NCS, a new protocol called Try-Once-Discard (TOD) is 
introduced. This new protocol dynamically allocates network resources to 
information sources based on need. An analytical stability condition for 
wireline NCS with both single-packet and multiple-packet transmission is 
derived. The condition is valid for both dynamic scheduling (e.g., TOD) and 
static scheduling algorithms (e.g., token passing). Simulation results 
verify the analytical prediction and demonstrate that dynamic scheduling 
outperforms static scheduling." 

For real-time traffic scheduling of the wireless NCS, we propose a new 
protocol - Prioritized CSMA/CA is proposed; this is based on the definition 
of IEEE 802.11 and the requirements of real-time control and monitoring. 
For the first time, several algorithms for scheduling the traffic of 
wireless NCS have been proposed and validated; that is, constant penalty 
scheme, estimated error order scheme and lag first order scheme. All 
algorithms (include static scheduler) are compared with each other via 
simulation and the results show l^ain that dynamic scheduling outperforms 
static scheduling (e.g., fixed order polling). 

Several experiments were conducted to verify the former theoretical 
analysis and simulation results. In the network-only experiment, plant and 
controller dynamics are simulated via a digital computer; sensor nodes are 
realized by smart distributed I/O board with a controller area network 
(CAN) interface; and the communication channel is realized by CAN hardware. 
The result shows that dynamic scheduling (TOD) outperforms static 
scheduling in a real network environment allowing multiple packet 
transmission. The theoretical and numerical results of NCS with one packet 
transmission is further validated by the wireline SISO system experiment on 
the process trainer PT326. Experiments on the wireless MIMO system ensures 
good system performance even lihder worst case; that is, with a static 
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Performance estimation of coirtouter systems an important topic to a 
large number of people in the computer industry/ Computer architects need 
to be able to study future machines \ compilerywriters need to be able to 
evaluate the compiler output before ^machin^ exists , and developers need 
insight into the machine's perfo$jnance\^in ofder to tune their code. There 
are many performance estimation Cechnicmes''^ that range from profile-based 
approaches to full machine simulation, afe^tailed simulation is one of the 
most common methods for estimating performance. It suffers, however, from 
potentially long run times when simula^ing\large applications using 
detailed processor models. / \ 

This thesis discusses a profiler-based performance estimation technique 
that uses a lightweight instrumentation phase \hat runs in order number of 
dynamic instructions, followed bv/an analysis \phase that runs in order 
number of static instructions. %n\s technique accurately predicts the 
performance of several pipelines including a detailed out-of-order issue 
processor model while scheduli/ng far fewer instructions than does full 
simulation. The difference between the predicted execution^ time and the 
time obtained from full simulation is only a few percent. An extension to 
the basic technique accurately predicts branch prediction and instruction 
cache effects, but fails yo handle data cache effects. Reasons for this 
failure are given. This thesis illustrates how this approach improves on 
earlier profile based analysis methods especially for the more advanced 
processor pipelines' and illustrates how future processor trends will need 
new approaches . 
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Recent research in techniqu§^ for optimizing parallel input/output has 
shown that matching available fife system policies to application access 
patterns and resource requirements can produce substantial performance 
improvements. However, parallel file system policy availability varies 
significantly across computation platforms. Similarly, parallel scientific 
access patterns and resource requirements vary not only across applications 
but also among phases within an application. Unfortunately, the potential 
range of policies and access patterns make an exhaustive search of the 
policy space for optimal combinations prohibitively expensive. 

We propose a twp phase optimization strategy that first applies factor 
analysis to identify the most effective policy combinations on a given 
platform. Second, at run time , it performs closed loop control to 
refine policies. Rather than requiring potentially millions of experiments 



to. identify an effective 'optimal' policy combination, factor analysis 
uses a reasonable number of experiments to identify an effective, though 
not provably optimal, policy. Having identified effective policies on a 
given platform, we automatically monitor access patterns and system 
performance at application run time and refine file system policy 
parameters as bottlenecks become evident. 

We have impTemented this approach via several extensions to the 
Portable Parallel File System (PPFS) testbed . Sensor metrics that 
summarize dynamic access pattern information and file system performance 
effectively guide adaptive policy selection, affording significant 
performance improvements over less adaptive optimization strategies. Our 
results with both parallel access pattern benchmarks and parallel 
scientific applications demonstrate the efficacy of our approach. 
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Tightly predicting worst-case execution times (WCETs) of programs on 
real-time systems with caches is difficult. Whether or not a particular 
reference is in cache depends on the program's previous dynamic behavior. 
While much progress has been acc^plished recently on predicting 
instruction cache performance of programs, bounding worst-case data cache 
performance is significantly more challenging. Unlike instruction caching, 
many of the data addresses referenced by load and store instructions can 
change during the execution of a program. This dissertation describes an 
automatic tool-based approach for statically bounding the worst-case data 
cache performance .of large code segments. It also presents the work done to 

verify the validity of the computed bounds. The given approach works on 
fully optimized code, performs the analysis over the entire control flow 
of a program, detects and exploits both spatial and temporal locality 
within data references, produces results typically within a few seconds, 
and produces, on average, 30% tighter WCET bounds than can be predicted 
without analyzirig data cache- behavior . - * 

The given method of timing analysis involves several steps. First, 
data flow analysis within an optimizing compiler is used to determine the 
bounded range of addresses of each data reference relative to .a global 
symbol or activation record. Second, virtual address ranges are calculated 
from the relative address ranges by examining the order of the assembly 
data declarations and the call graph of the entire program. Third, the 
control flow of the program is analyzed to statically categorize the 
caching behavior of each data refference. Fourth, these categorizations are 
used when calculating the pipeline performance of sequences of 
instructions representing paths within the program. Finally, the pipeline 
path analysis is used to estimate the worst-case execution performance 

of each loop and function in the program. 

Overall, this dissertation presents a comprehensive report on methods 
and results of worst-case timing analysis of data cache behavior and 
shows that such an analysis can lead to a significantly tighter 
worst-case performance prediction. The given approach is unique and 
provides a considerable step towards realistic worst-case execution time 
prediction of contemporary architectures and its use in schedulability 
analysis for real-time systems. 
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High performance Fortran (HPF) , as well as its predecessor FortranD, 
has attracted considerable attention as a promising language for writing 
portable parallel programs for a wide variety of distributed-memory 
architectures. Programmers express data parallelism using Fortran90 array 
operations and use data layout- directives to direct the partitioning of- the- 
data and computation among the processors of a parallel machine. 

For HPF to gain acceptance as a vehicle for parallel scientific 
programming, it must achieve high performance on problems for which it is 
well suited. To achieve high performance with an HPF program on a 
distributed-memory parallel machine, an HPF compiler must do a superb job 
of translating Fortran90 data-parallel array constructs into an efficient 
sequence of operations that minimize the overhead associated with data 
movement and also maximize data J^jbcality. 

This dissertation presents and analyzes a set of advanced 
optimizations designed to improve the execution performance of HPF 
programs on distributed-memory architectures. Presented is a methodology 
for performing deep analysis of Fortran90 programs, eliminating the 
reliance upon pattern matching to drive the optimizations as is done in 
many Fortran90 compilers. The optimizations address the overhead of data 
movement, both interprocessor and intraprocessor movement, that results 
from the translation of Fortran90 array constructs. Additional 
optimizations address the issues of scalarizing array assignment 
statements, loop fusion, and data locality. The combination of these 
optimizations results in a compiler that is capable of optimizing dense 
matrix stencil computations more completely than all previous efforts in 
this area . This work is distinguished by advanced compile-time analysis 
and optimizations performed at the whole-array level as opposed to 
analysis and optimization performed at the loop or array-element 
levels . 
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An exploration of a microprocessor architecture for Digital Signal 
Processing is presented. The exploration methodology is similar to the 
analyses used in the design of Reduced Instruction Set Computers. 



. The proposed architecture is compatible with the Motorola DSP56000 
microprocessor at the assembly- code level, but employs - greater pipelining 
to improve performance . High-level simulation of the 56000 and the 
intermediate experimental architectures is used to characterize the 
execution of a suite of benchmark programs. The profiling data is used to 
reorganize the architecture to improve performance. 

The proposed architecture is shown to improve the performance by a 
factor in the range of 1.25 to 2.11 across the benchmarks on a per clock 
basis . 

An implementation proposal ipr the architecture is presented that 
enumerates the additional resources required by the proposed architecture 
and includes floorplans for a VLSI implementation. 
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P. 

Special computer architectures for design automation (DA) problems are 
a practical solution to manage the increasing complexity of designing large 
integrated systems. A class of cellular architectures is studied which is 
applicable to physical DA problems that are cellular in nature: problems 
well-represented on a cellular grid with strongly local functional 
dependencies. The Raster Pipeline Subarray (RPS) class is a systolic 
organization, the central features of which evolved historically in 
classical picture-processing applications. RPS-structured DA engines have 
several attractive engineering properties: transparent expansion of 
processing capacity with a linear pipeline , direct accommodation of large 
grids with the raster data format, and application to differing tasks with 
programmable pipeline stages-. - - - 

This thesis studies routing and integrated circuit design rule 
checking in an RPS environment. An experimental hardware/software RPS 
environment is constructed around existing RPS hardware; tools are 
developed to support the design and debugging of large-scale RPS-based DA 
systems. Maze-routing is the major application of interest; a mapping of 
maze-routing algorithms onto an RPS pipeline is developed and its 
complexity analyzed . A progression of increasingly complex, fully 
functional routers is implementec^.in the prototype RPS environment to 
verify feasibility. Large-scale benchmarks and comparisons with software 
routers show significant speedups . Design rule checking is considered 
from an algorithmic viewpoint: it is shown that a formalism derived from 
picture-processing tasks provides an elegant conceptual and notational tool 
for mapping rules checking onto an RPS pipeline . Hardware 
implementations of some specifications are analyzed . These studies prove 
that existing RPS engines can be improved incrementally and gracefully: 
additional pipeline stages improve execution times. Experiments over a 
range of pipeline lengths, and extrapolations based on experimental 
measurements support this claim. From these experiments, detailed 
performance models and cost/performance metrics are developed, treated 
analytically, and rigorously optimized. These results are -employed to 
deduce the necessary functionality and tradeoffs to design optimal 
RPS-structured DA engines. 
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This dissertation presents ^^^solution to the collision problem of 
dynamic pipelines . A dynamic pipeline can execute various algorithms 
each of which will be executing different functions as defined by the 
algorithm's flow graph and the data being input. Dynamic pipelines 
therefore have the problem of collisions between the algorithms as they 
each try to execute the various functions. This collision problem is 
currently circumvented by requiring that the pipelines be synchronous. 
All functions must execute in a fixed amount of time. This dissertation 
describes a technique for building asynchronous pipelines . These 
asynchronous pipelines are also distributed pipelines in that the 
functions to be executed are distributed over the processors within the 
pipeline . A function can be executed in more than one processor, but 
once a processor .starts to execute the function, it will completely execute, 
that function. 

Since the processors (which are themselves pipelined ) can not 
execute every function, they are individually built to execute only those 
functions assigned to them. The pipeline itself is restricted to a 
specific area of applications such as signals analysis . 

The manner in which the functions are distributed to the 
processors within he pipeline is by first calculating a utility 
coefficient for each function. T^e utility coefficients are then used by a 
distributed algorithm to distribute the functions to the processors. 
Security processors may be added to the pipeline to both improve 
throughput and enhance flexibility of the pipeline . Security processors 
allow for the arrival of functions that have execution distributions that 
differ from what was previously specified. The utility coefficient can also 
be used to calculate the maximum arrival rate (of functions ) that the 
pipeline can accept. Given the actual arrival rate, the utility 
coefficients can be used to calculate the average contents of the pipeline 
, and also the average contents of each processor within the pipeline . 
Since the utility coefficients are fairly easy to determine (and are the 
basic tool in designing the pipeline ) the pipeline designer is able to 
both build and study (through .simulated models by using a, simulator given 
in this dissertation) various asynchronous distributed pipelines without 
difficulty. 
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Several new algorithms for use for enhancing the performance of 
pipelined digital computers have been developed and evaluated. The design 
of a particular such. computer embodying most of these algorithms is 
discussed in detail — the S-1 Mark IIA. The relationship and importance of 
the new algorithms to the overall performance of such a machine is 

analyzed . . , - . 

An algorithm for the very rapid pipelined computation of medium 
precision approximations (about 30 bits for the Mark IIA) to elementary 
functions is described. This method uses table lookup and two parallel 
multiplications to triple the precision available from direct table lookup. 
Current RAM technology permits the effective use of this algorithm for 
non-trivial word sizes. The method is applied to reciprocal, square-root, 
exponential, logarithm, arctangent, sine, cosine, and the error function. 

A floating-point additigp algorithm which has a much shorter 
latency than previous approaches '*''is developed and analyzed . This 
algorithm lends itself to the efficient simultaneous calculation of 
floating-point sums and differences, which is of great value computing FFTs 
and to other related algorithms. The algorithm resolves floating-point 
addition into one of two independent cases, each of which can be 
implemented in fewer logic gate delays than previous algorithms. 

Previous techniques of sorting on pipelined machines are 
analyzed and a new algorithm based on Quicksort is developed. This new 
algorithm is significantly faster and simpler than previous pipelined 
sorting techniques . 

The use of skewed data representations to increase the performance 
of interleaved memories for many algorithms is well known. However a large 
price is paid in convenience by the use of such techniques. A new approach 
which allows the use of normal data representations but which has all of 
the performance advantages of the skewed representations is described. This 
technique is particularly valuable since the hardware used to implement it 
can also serve as a queue to minimize the effects of temporary stoppages in 
the instruction and operand fetching and arithmetic execution hardware. 

Several other new techniques for performance enhancement are also 
described and analyzed , and fry^tful directions for future work in this 
area are discussed. 
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Abstract: A discussion of the regulations and safety measures regarding 

construction or repair worky^'ri, or in the vrcinity of, an operating oil 

pipeline , which could darriage the pipeline \c overs administrative 
procedures that should pre;tede the execution \pf the work; general 
rules for the construction of oil or gas pipelines , cables, tunnels 
and the like in parallel/ or crossing an existing \pipeline or of 
railroads or highways alDove a pipeline ; and reco^endations for the 
protection of the existing pipelines during the \jork , including 
methods for locating and uncovering buried pipelined and protecting 
the work area. , the use of explosives, and examin^ion of the 
pipeline and restoration of the area after the completion of the 
work . 
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Abstract: In this paper, we describe a digital pulse-width modulator 
(DPWM) capable of 8-bit resolution at the switching frequency of 1 MHz. The 
DPWM chip, which has been designed and fabricated in a standard digital' 
CMOS process, has novel architecture and takes relatively small silicon 

area . The power consumption is also relatively low. The potential for 
high performance and low cost digital realization opens the possibility of 
outperforming standard analog controllers, while adding an array of 
advantages of digital technology: higher production yield, design-time or 

run - time programmability , reduction or elimination of external tuning 
components, implementation of ^ more sophisticated control algorithms, 
improved system integration aild diagnostics, etc. The DPWM chip has been 
experimentally tested . A low-power 1 MHz synchronous buck converter is 

operated as a closed- loop voltage regulator with a controller that 
consists of the DPWM chip and an Analog Devices ADMC-401 DSP system. The 
architecture and operation of the DPWM chip are described. The complete 
closed- loop controller implementation and experimental results are 
presented. (5 Refs) 
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Abstract: There is growing interest in studying the possibility of 
reconf igurable architectures as replacements for general purpose computing 
for certain application domains. Reconf igurable systems can take advantage 
of deep computational pipelines , perform concurrent execution and are 
inherently data flow in nature. Furthermore, these systems have the 
capability of 'on the fly' reconfiguration of all or portions of the 
hardware to represent all the functionality required to complete the 
execution of an application. However, these architectures suffer from 
slow run ti-me reconfiguration (RTR) due to the ^ f act -that . the- 

configuration memory resides off-chip and hence requires high access 
latency. This disadvantage limits the system performance and the 
application domain in which reconf igurable systems could prove effective. 
To overcome slow RTR, recent approaches include on-chip configuration 
memory to cache the next possible configurations. This approach trades off 
die area for fast RTR which diminishes the processing power of the 
reconf igurable processor. The high cost of adding configuration cache, up 
to 50^ of the die area / iJvWould considerably increase the number of 
hardware reconfigurations required compared to architectures without 
on-chip cache. This paper presents an alternative reconf igurable 
architecture which overcomes these limitations by exploiting high bandwidth 
optical channels. We develop a performance model to analyze and compare 
the performance of cache based RTR architectures, optical based RTR 
architectures and hybrid optical-cache based RTR architectures. (4 Refs) 
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Abstract: The design process of an 8-bit 32-processing-units serial 
pipeline VLSI modular processor performing the parallel execution of 
a kernel classifier recall phase is described. Starting from VHDL 
high-level modelling, after verifying the correct circuit behaviour by 
means of digital simulation, the circuit has been synthesized and mapped on 
a 0.7 micron CMOS technology. Both area and delay optimized syntheses are 
performed for each- processor cell, selecting in each case the best 
solution. Taking advantage of the constant data flow of the pipeline 
architecture, a dynamic realization of the memory elements using tri-state 
standard cells .is proposed... This reduces both the circuit area . and. 
delays, without losing the convenience of an automatic standard cell 
placement and routing. From synthesis results, a working frequency of about 
300 MHz is expected. A parallel-serial interface reduces external clock 
frequency requirements, and thus matching the external frequency 
limitations with fast on-chip processing. (11 Refs) 
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Abstract: Using runtime information of load distributions and processor 
affinity, we propose an adaptive scheduling algorithm and its variations 
from different control mechanisms. The proposed algorithm applies different 
degrees of aggressiveness to adjust loop scheduling granularities, aiming 
at improving the execution ^^perf ormance of parallel loops by making 
scheduling decisions that match trie real workload distributions at runtime 
To verify the effectiveness of the algorithm and its variations, we 
implemented them on the KSR-1 and on the Convex Exemplar. We experimentally 
compared the performance of our algorithm and its variations with several 
existing scheduling algorithms on the two machines. The kernel application 
programs we used for performance evaluation were carefully selected for 
different classes of parallel loops . Our results show that using runtime 
information to adaptively adjust scheduling granularity is an effective 
way to handle loops with a wide range of load distributions when no 
prior knowledge of the execution can be used. The overhead caused by 
collecting runtime information is insignificant in comparison with the 
performance improvement. Our experiments show that the adaptive algorithm 
and its five variations outperformed the existing scheduling algorithms. ( 
7 Refs) 
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Abstract: We 'have previously presented the architecture and basic 
analytic results and for a functional 1-D pipelined hybrid 
optical/digital processing concept capable of generating a target range 
-Doppler profile in real time. Here we address the fundamental system 
processing algorithm and hardware development issues in some detail. The 
approach to performing real-time phase correction of the individual range 
profiles is outlined, along with the basic system operational runtime 



algorithms and system processin^i^" pipeline . A description of the receiver 
hardware and its component functionality in terms of the presented 
operational theory is given as well. (3 Refs) 
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Abstract: For the design of pipelined systems satisfying given area 
or time constraints, high level synthesis techniques are employed to allow 
the users to explore design space. This paper proposes a heuristic 
algorithm that performs pipeline scheduling under area constraints. 
The proposed algorithm generates the design satisfying a given area 
constraint, while optimizing the overall execution time. The algorithm 
tries to distribute to each ^^partition evenly by employing the list 
scheduling algorithm enhanced "^with delay insertion capability. It also 
supports chaining, multi-cycling, and mutual exclusion. Experimental 
results show that the proposed algorithm generates an optimal design for 
several benchmark programs. (12 Refs) 
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Descriptors: circuit layout CAD; delays; heuristic programming; pipeline 
processing; program testing ; scheduling 

Identifiers: area -constrained scheduling algorithm; synthesis of 
pipelined systems; time constraints; high level synthesis techniques; 
heuristic algorithm; pipeline scheduling; area constraints; list 
scheduling algorithm; delay insertion capability; chaining; multicycling; 
mutual exclusion; optimal design; benchmark programs; circuit layout CAD 

Class Codes: C5'220P (Parallel architecture); C7410D (Electronic' 
engineering) ; C5440 (Multiprocessor systems and techniques) ; C6150G ( 
Diagnostic, testing, debugging and evaluating systems) 
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4506070 INSPEC Abstract Number: C9312-6130B-005 
Title: Interactive volume rendering of large fields 

Author(s): Sakas, G. 

Author Affiliation: Fraunhof er-Inst . for Comput. Graphics, Darmstadt, 
Germany 

Journal: Visual Computer vol.9, no. 8 p. 425-38 



Publication Date: Aug. 1993 Country of Publication: West Germany 
CODEN: VIC0E5 ISSN: 0178-2789 

Language: English Document Type: Journal Paper (JP) 
Treatment: Practical (P) 

Abstract: Presents the volume-rendering pipeline and the most typical 
of the existing methods for each pipeline stage. The complexity of each 
stage in terms of computing time is analyzed for each method. Then the 
demands and the scope of interactive volume rendering are briefly 
summarized. Based on this analysis , the author examines alternate 
solutions to optimize each pipeline stage in order to allow interactive 
visualization while maintaining the image quality. The proposed method 
maximizes interactive manipulation possibilities and minimizes runtimes by- 
sampling at the Nyquist rate and by flexibly trading off quality for 
performance at any pipeline level. This approach is suitable for 
rendering large, scalar, discrete volume fields such as semi-transparent 
clouds (or X-rays) on the fly. (39 Refs) 

Subfile: C 

Descriptors: clouds; data visualisation; interactive systems; natural 
sciences computing; Nyquist criterion; optimisation; pipeline processing; 
rendering (computer graphics); X^^ays 

Identifiers: interactive volume rendering; discrete scalar volume fields; 
pipeline ; computing time; image quality; interactive manipulation; 
runtime minimization; Nyquist sampling; performance; semitransparent 
clouds; X-rays; large fields; scientific visualization 

Class Codes: C6130B (Graphics techniques); C7300 (Natural sciences) 
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Seismic reliability of buried pipeline networks. No. 3\ 

HAGIO AKIRA (1); SUZUKI NOBuklSA (1); OOBA SYUUKOU (1)/ KAMEMURA TOSHIHIKO 

(2); SUZUKI ISAMU (2) \ / 

(1) Nihonkokan Jukoken; (2) Nippon Kokan K.K. / 

Nippon Kokan Giho (Nippon Kokan ^^echnical Report) , ^987, NO. 116, PAGE. 53-60 

, FIG. 14, TBL.7, REF.ll \ Y 

JOURNAL NUMBER: F0229AAG ISSN NO: 0468-2815 / CODEN: NPKGA 

UNIVERSAL DECIMAL CLASSIFICATION!?)^. 62X. 14/ . 15+^8 . 2 62 4 . 13 1 . 53/ . 55 

LANGUAGE: Japanese COUNTRY OF^UBLI CATION : Japan 

DOCUMENT TYPE: Journal \ / 

ARTICLE TYPE: Original paper 

MEDIA TYPE: Printed Publication /\ 

ABSTRACT: Probabilistic approach has been in^e for simulations of 
restoration works of a damaged /pipelirite network for gas 
distribution service by introduoang the Markov process. A new 
approximate solution technique / Subnetwork M^^hod, is proposed in order 
to evaluate connectivity betwefen a supply node^ and a demand node, and 
quite effective for reductioi/ of the computer isun time . The damages 

analyzed are those of Sendai City caused by the\Miyagiken-oki 
earthquake in 1978. The simulations conducted in tni's-^aper deal with 
the process of recovery of/ the network system from different aspects. 
They are evaluation of reqovery rate of some subdivided areas and 
connectivity between the designated nodes during the restoration works. 
In consequence of the results of the simulations, it can be recommended 
that the restoration works should be started from the subdivided area 
in which the least amount of damages are observed. (author abst.) 

DESCRIPTORS: pipeline (system); lifeline; seismic analysis ; mathematical 
model; gas service industry ; i^ifeeismic damage; disaster recovery; 
block (object ) ; gas supply system; Markov process; CPM; network; Miyagi; 
buried pipe; town gas; reliability (property) ; evaluation; earthquake 

BROADER DESCRIPTORS: pipe line; facility and building; structure analysis 
; analysis ; model; industry; natural disaster; disaster; disaster 
countermeasure; countermeasure ; recovery and reclamation; system; 
stochastic process; process; network programming; operations research; 
Tohoku District; Japan; East Asia; Asia; pipe classified by application 
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SYMANTEC DELIVERS IMPROVED SPEED PERFORMANCE IN ENTERPRISE DEVELOPER 2.5 

DATE: October 2, 1995 07:57 EDT WORD COUNT: 527 

...database engines;. a new report generator. Crystal 
Reports; Informix Data Link; and a message customizer. 

In benchmark testing , Coordinated Digital Systems, a Charlotte-based 
consulting group/ found Enterprise Developer to be substantially faster 
than Version 2.0 in the main tested categories. "The browse performance 
was particularly impressive," said Pete Griffiths, president of CDS. 
"This is the area that is critical to our application development and 
Enterprise Developer performed better than any other product we 
evaluated." The results of these tests indicated that the query speed 
retrieval rates increased 500%, from 2,860 to 15,400 records in 
60 seconds and SCALEScript exec\i,fion speed doubled, resulting in a 
50% speed reduction time for loop testing . 

"We are very pleased with this new version of Enterprise Developer, " 
said Ted Schlein, vice president of... 
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04532168 Supplier Number: 46659432 (USE FORMAT 7 FOR FULLTEXT) 
Single -board Machine Vision for OEMs 

News Release, pN/A 
August 28, 1996 

Language; English Record Type: Fulltext 
Document Type: Magazine/ Journal ; Trade 
Word Count: 542 

(USE FORMAT 7 FOR FULLTEXT) 
TEXT : 

...is capable of issuing and retiring as many as three instructions per 
clock cycle, using five independent execution units. The 603e can execute 
a single precision floating-point multiply and add operation in every clock 



...operating system that supports multi-tasking and yields fast context 
switching tunes. mvPower includes Datacube*s powerful pipeline processing 
architecture, which pipes video data through a configurable series of 
sequentially connected computational elements. Each of these elements 
optimizes the performance of a specific image processing task. Multiple 
parallel pipelines work synchronously to further enhance data 
throughput. mvPower's AU element a built-in arithmetic processor and 
crosspoint. . . 

...three VSIM memories which, in addition to storing images in memory, are 
capable of certain image processing functions . Each of Datacube's VSIM 
modules has a crosspoint switch, statistical processor, LUT, ALU, and 
feature extractor... 

. . .MaxACQ modules provide high tljroughput and simplify integration for a 
variety of analog and digital sensors including area , line-scan, and TDI 



types in a range of resolutions. The basic MaxACQ module on mvPower has 
two 4: I camera multiplexers and supports simultaneous... 

...easy-to-use, the tools support image acquisition, preprocessing, object 
finding (normalized gray-scale correlation or blob analysis ) / metrology, 
and camera calibration. The stancj-alone mvPower image processing engine 
simplifies integration and improves performance in a wide variety of 
applications including SMD placement, wire bonding, die bonding, robot 
guidance, label inspection , and more. Dr. Scott Roth, Vice President, 
Datacube's Machine Vision Group, explains "our target market for mvPower is 
OEMs who need motion guidance, accurate measurement , and fast inspection 

in their machines, and our volume pricing has been set to attract them." 
Founded in 1979, Datacube... 

...of image processing hardware, software, and systems designed to meet 
technology and business requirements of a wide range of scientific and 
commercial users. Datacube is a primary supplier to Fortune 100 firms and 
major research. . . 
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02246882 

Computer Products Introduces LAN Compatible RTP 

News Release March 2, 1989 phi 

Computer Products, Inc. Measurement & Control Division has introduced 
an intelligent network product which interfaces their analog and digital 
I/O product family, RTP, with Local Area Network environments. Named G-2 
LAN Compatible RTP, it will offload a host computer by providing full 
preprocessing capability, including such features as scanning, engineering 
units conversion, calculations, sequence of events, and closed loop 
control. G-2 will initially operate in a DECnet/Ethernet environment. 
Based on Motorola 68620/68881 technology, G-2 is a 19-inch. . . 

... aocommodation for up to three Input/Output Bus Converters. The G-2 
processor card 'combines a high performance 68020 'CPU with a 6B881" 
Floating Point Coprocessor, each running at 16 MHz, for extremely fast 
instruction execution . The 256 Kbyte ROM contains the Executive and 
Database Compiler while the Run Time code and Sequence of Events data 
are stored in 1 Mbyte of onboard dynamic RAM. 



